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MAMMAGLOBIN, A SECRETED 
MAMMARY-SPECIFIC BREAST CANCER PROTEIN 

Related Applications 

This application is a continuation-in-part of PCT/US96/08235, filed May 
31, 1996 which is a continuation-in-part of U.S. Application No. 08/455,896, filed 
5 May 31, 1995. 

Background of the Invention 

(1) Field of the Invention 

This invention relates generally to the field of breast cancer pathogenesis 
and, more particularly, to a cDNA sequence and encoded mammary-specific protein 
1 0 for use in detecting and treating breast cancer. 

(2) Description of the Related Art 

Breast cancer is one of the most common and potentially lethal of cancers. 
Although early diagnosis and treatment can reduce morbidity and 
mortality related to the disease, the positive predictive value of mammography has 
15 been estimated to be only about 25% (Hall et al., NEnglJMed 327:3 19-328, 1992 
which is incorporated by reference). It would, therefore, be desirable to have a means 
for detecting the cancer earlier than the cancer can be detected using mammography 
and a genetic or biochemical marker might be able to provide such means to 
♦ complement and increase the predictive value of mammography. (Hayes, Hematol 
20 Oncol Clin N Am 5:485, 1 994 which is incorporated by reference). 

The development of breast cancer is accompanied by a number of genetic 
changes (For review see Porter- Jordan, Hematol Oncol Clin N Am 5:73, 1994 which 
is incorporated by reference). Such changes include gross chromosomal alterations 
and loss of genetic markers (Devilee et al, Biochim Biophys Acta 1198:113, 1994; 
25 Callahan tt al, J Cell Biochem Suppl 77:167, 1993 which are incorporated by 
reference). The progression of breast neoplasia has also been shown to result in 
qualitative and quantitative changes in expression of previously identified genes that 
encode growth factors and their receptors (Zajchowski et al., Cancer Res 45:7041, 
198& which is incorporated by reference), structural proteins (Trask et al, Proc Natl 
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Acad Sci 57:2319, 1990 which is incorporated by reference), second messenger 
proteins (Ohuchi et al., Cancer Res 26:25 1 1, 1986 which is incorporated by 
reference), and transcription factors (Harris, Adv Cancer Res 59:69: 1992 which is 
incorporated by reference). These changes in gene expression could potentially form 
5 the basis for developing a breast cancer marker, although the precise role of these 
gene changes in the pathogenesis of breast carcinoma in patient biopsy samples is not 
well understood. 

In addition to providing a genetic or biochemical marker for breast cancer 
for early detection of the disease, it would also be desirable to have a tumor marker 

10 that might provide an estimation of prognosis, a means for selection and evaluation of 
therapy and a means for the targeting of therapy. Although a number of tissue 
markers have been identified, none are sufficiently sensitive or tumor specific to be 
ideally suited for diagnosis or for screening the general population. (Id). Thus, there 
remains a continuing need for a breast cancer marker such as a gene along with its 

15 expressed protein that can be used to specifically and selectively identify the 

appearance and pathogenic development of breast cancer in a patient, and that can be 
used in tumor-specific immunotherapy. 

Using a modified differential display polymerase chain reaction technique 
to isolate differentially expressed sequence tags from mammary carcinoma, several 

20 sequence fragments were isolated that were uniquely expressed in neoplastic 
mammary epithelial tissue as compared to normal tissue controls (Watson and 
Fleming, Cancer Res 54:4598-4602, 1994 which is incorporated by reference). The 
discovery of one of these sequence tags identified as DEST002 has led to the 
discovery and isolation of the novel full length cDNA and encoded protein now 

25 referenced as mammaglobin. The cDNA and protein are both new. 
Summary of the Invention 

Briefly, therefore, the present invention is directed to the identification of 
novel genes whose expression is increased in breast cancer and to the isolating of 
cDNA's from the inRNA's of these genes. Accordingly, applicants have succeeded in 

30 discovering a novel cDNA and the encoded mammary-specific secretory protein, 
mammaglobin. The cDNA is in purified and isolated form and has a nucleotide 
sequence identified as SEQ ID NO:l and the encoded protein, mammaglobin, is in 



WO 99/14230 PCT/US98/1 7991 

3 

purified and isolated form and has an amino acid sequence identified as SEQ ID 
NO:2. 

Mammaglobin is overexpressed in 27% of stage I primary breast cancer 
tumors. This suggests that dysregulation of the mammaglobin gene occurs early and 
5 frequently in breast cancer. The discovery of mammaglobin and its cDNA, therefore, 
provide the basis for the development of novel methods and compositions for the 
detection and treatment in humans and other mammals. 

Thus, the present invention is directed to novel methods for detecting the 
presence of breast neoplasia cells in a sample. In one embodiment, cDNA encoding 
1 0 mammaglobin or a derivative of said cDNA is used to detect the presence of 

mammaglobin mRNA in a sample. The method comprises the steps of: (a) providing 
a polynucleotide containing a nucleotide sequence having the sequence of SEQ ID 
NO: 1 or a derivative thereof, (b) incubating the nucleotide sequence with the sample 
under conditions in which the sequence can hybridize with mRNA from breast 
1 5 neoplasia cells, and (c) detecting the existence of a DN A-RNA hybridization 
complex. 

Another aspect of the present invention provides a kit for detecting the 
presence of breast neoplasia cells in a sample by hybridization. The kit comprised a 
polynucleotide containing a nucleotide sequence having the sequence of SEQ ID 

20 NO: 1 or a derivative thereof packaged in a container. 

In another embodiment of the present invention, mammaglobin expression 
in a sample is determined by detecting the presence of cDNA that is reverse 
transcribed from mammaglobin mRNA in the sample. The method comprises the 
steps of: (a) producing a cDNA encoding mammaglobin from mRNA using the 

25 reverse transcription method in a sample obtained from a patient, (bj providing two 
primers for the polymerase chain reaction method which comprise oligomers that 
flank or lie within the cDNA encoding mammaglobin, and (c) amplifying the cDNA 
encoding mammaglobin by the polymei^e^ The two primers 

have nucleotide sequences comprising SEQ ID NO:3 and SEQ ID NO:4. 

30 Another embodiment to the present invention provides a kit for detecting 

the presence of breast neoplasia cells in a sample by the polymerase chain reaction. 
TIiq kit comprises two primers for the polymerase chain reaction method which 
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comprise oligomers that flank or lie within a cDNA encoding mammaglobin 
packaged in a container. The two primers have nucleotide sequences comprising SEQ 
ID NO:3 and SEQ ID NO:4. 

In another embodiment of the present invention, the presence of 
5 mammaglobin protein expressed by a tumor cell is detected in a sample using specific 
antibodies to the mammaglobin protein. The specific antibodies can be polyclonal or 
monoclonal antibodies. 

The invention is also directed to novel compositions and methods for 
treating breast neoplastic disease using mammaglobin antigens capable of inducing an 
10 antibody-mediated and/or a cell-mediated, i.e., through activated T cells, immune 
response against a mammaglobin-expressing tumor. 

One embodiment of a composition according to the invention comprises a 
mammaglobin B cell antigen capable of activating mammaglobin-specific B cells. 
The B cell antigen comprises a mammaglobin-specific B cell epitope and a T H 
1 5 epitope, or determinant, recognized by T helper cells. 

In another embodiment, the mammaglobin antigen is a mammaglobin T c 
cell antigen recognized by mammaglobin-specific cytotoxic T lymphocytes which 
comprises a T c cell epitope and a binding site, or agretope, for a MHC class I 
molecule. 

20 Yet another embodiment of a composition according to the invention 

comprises B cell and T c cell antigens. 

Methods for treating a patient with a mammaglobin-expressing tumor 
include adoptive immunotherapy, which comprises ex vivo stimulation with a 
mammaglobin antigen of mammaglobin-specific lymphocytes isolated from the 

25 patient and subsequent administration of the activated lymphocytes to the patient, and 
in vivo stimulation of an anti-mammaglobin immune response, which comprises 
administering to the patient a vaccine comprising a mammaglobin antigen. 

Among the several advantages found to be achieved by the present - 
invention, therefore, may be noted the provision of a nucleotide sequence and < 

30 encoded amino acid sequence that can serve as markers for breast cancer cells; the 
provision of methods for early detection of the presence of breast neoplasia cells; the 
provision of means for detecting breast cancer that can complement mammography 
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and increase the predictive value; the provision of methods that can provide an 
estimation of prognosis; the provision of markers that will allow the targeting of 
therapy; and the provision of compositions for stimulating a cellular and humoral 
immune response against the tumor. 
5 Brief Description of the Drawing s 

Figure 1 illustrates the strategy used to isolate the foil length 
mammaglobin cDNA including the Rapid amplification of cDNA Ends (RACE) 
Polymerase Chain Reaction (PCR) technique and subsequent subcloning into vectors 
pGEM7Zand pCEV27. 
1 0 Figure 2 illustrates the human cDNA sequence of SEQ ID NO: 1 

(nucleotides numbered above) and the amino acid sequence of the encoded 
mammary-specific protein, mammaglobin (SEQ ID NO:2)(amino acids numbered 
below), the solid bar indicating the 403 bp fragment (SEQ ID NO: 5) isolated by the 
RACE PCR method and the open bar indicating the 206 bp DEST002 sequence (SEQ 
15 ID NO:6); 

Figure 3 illustrates the amino acid sequence of the mammary-specific 
protein^ mammaglobin (hMAM), (SEQ ID NO:2) compared to rat prostatic steroid 
binding protein subunit C3 (rPSC3)(SEQ ID NO:7) and human clara cell 10 kD 
protein (hCC10)(SEQ ID NO:8) with identities marked by bold letters and double 
20 lines and structurally similar amino acids marked by single lines; 

Figure 4 illustrates (A) the Northern blot analysis of hybridization of the 
human cDNA sequence encoding the mammary-specific protein, mammaglobin 
(hMAM), to mRNA expressed by tissues from breast neoplasia, normal breast and 
other adult tissues and (B) the analysis of RT/PCR amplified samples of tissues from 
25 breast neoplasia, normal breast and other adult tissues; 

Figure 5 illustrates the translation of the mammary-specific cDNA 
sequence in an in ritw rabbit reticulocyte lysate assay system; 

Figure 6 illustrates Northern blot ^tmdizatioii ^th tfte cDNA encoding 
mammaglobin showing detection of mRNA in tumor 2410, in tumors from three of 
30 eight other patients (shown in bold), and to a lesser extent, in normal breast tissue 
(shown in italics), and comparing in two cases (the four lanes on the right) 
mammaglobin mRNA expression in tumor tissue and patient matched normal tissue; 
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Figure 7 illustrates the Western blot analysis using polyclonal antibody to 
the mammaglobin C-terminus (SEQ ID NO: 14) of the conditioned medium (S) and 
cell lysate (C) from MDA-MB-415 breast tumor cells in the absence (-) and presence 
(+) of the immunizing peptide showing detection of the precursor and secreted forms 
5 of mammaglobin protein in the cell medium and cell lysate, respectively; 

Figure 8 illustrates the Western blot analysis using the anti-mammaglobin 
polyclonal antibody of the conditioned medium (S) and cell lysate (C) from MDA- 
MEM 15 breast tumor cells grown in the absence (-) and presence (+) of tunicamycin, 
which blocks glycosylation, showing the lack of detectable mammaglobin protein in 
1 0 the lysate or medium of cells in which N-linked glycosylation is inhibited; 

Figure 9 illustrates the Western blot analysis of cell lysates from human 
breast tumor cells showing detection of the precursor mammaglobin protein using the 
anti-mammaglobin polyclonal antibody and goat anti-rabbit antibody visualized by 
enzyme-linked chemiluminescence; 
15 Figure 10 illustrates the Western blot analysis using the anti-mammaglobin 

polyclonal antibody of fluid secretions from human breast during pregnancy and 
postpartum showing detection of the secreted mammaglobin protein in proliferating 
mammary gland; 

Figure 1 1 A illustrates in color a paraffin-fixed section of breast cancer 
20 cells from a patient specimen, immunohistochemically stained using the antU 
mammaglobin polyclonal antibody and goat anti-rabbit antibody tagged with 
horseradish peroxidase and DAB as substrate showing a brown staining of cells 
expressing the mammaglobin protein; and 

Figure 1 IB illustrates in black and white a paraffin-fixed section of breast 
25 cancer cell from a patient specimen immunohistochemically stained using the anti- 
mammaglobin polyclonal antibody and goat anti-rabbit antibody tagged with 
horseradish peroxidase and D AB as substrate wherein the brown staining of cells 
expressing the ma^aglobin protein is indicated. , ^; 
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Description of the Preferred Embodiments : 

One aspect of the present invention is based upon the identification and 
sequencing of the cDNA identified as SEQ ID NO:l which encodes a mammary- 
specific secretory protein, mammaglobin, identified by SEQ ID NO:2 (FIG 2). As 
5 described below, the full length mammaglobin cDNA was isolated starting from 

tumor cell mRNA that was reverse transcribed, amplified using the technique of PCR 
and subcloned into expression vectors. In addition, the protein, mammaglobin, 
encoded by the cDNA was identified and characterized. 

Using the anonymous sequence tag previously designated DEST002, it 
10 was demonstrated that the corresponding gene product, which was up until now 

unknown but herein identified as mammaglobin, is particularly abundant in the breast 
cancer tumor cell line MDA-MB-415. To isolate the full length mammaglobin 
cDNA, the mRNA was reverse transcribed from this cell line and cloned using the 
RACE PCR technique (Edwards et at Nucleic Acids Research 79:5227-32, 1991 
15 which is incorporated by reference). This technique is based upon the strategy of 
ligation of single-stranded oligodeoxyribonucleotide to the 3* end of single-stranded 
cDNA. The method by which the mammaglobin cDNA was isolated is represented 
schematically in FIG 1 ; 

The full length 503 bp cDNA sequence (SEQ ID NO:l) was deduced from 
20 the sequence information obtained from the 403 bp fragment (SEQ ID NO:5) (FIG 2) 
isolated by this technique along with sequence information previously obtained from 
the corresponding DEST sequence (DEST002, SEQ ID NO:6) (FIG 2) in our earlier 
study (Watson and Fleming, supra). Within the 503 bp cDNA is a 279 bp open 
reading frame which encodes a polypeptide of 93 amino acids and predicted 
25 molecular mass of 1 0.5kD (SEQ ID NO:2) (FIG 2). The initial methionine of this 
open reading frame is within a near-perfect Kozak consensus sequence (Kozak, Cell 
22:7-8, 1980 which is incorporated by reference). The 60 bp upstream of this 
sequence contain no oth^ in- 
untranslated sequence ofthecDNA constitutes 163 bp and contains a polyadenylation 
30 signal, AATAAA, 12 bp upstream of the priming site of the original DEST002 
sequence. These data indicate that the full length mammaglobin cDNA has been 
isolated. The first 19 residues of the encoded polypetide predict a hydrophobic 
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peptide signal sequence and residues 53-55 and.68-70 are consensus N-linked 
glycosylation sites, indicating that mammaglobin is a secreted glycoprotein. 

A search for DNA sequences similar to the mammaglobin cDNA sequence 
in Genbank using the BLAST algorithm (Benson et al., Nucl Acid Res21 :2963-2965, 
5 1993; Altschul et al, JMol Biol 215:403-4 10, 1990 which are incorporated by 

reference), identified no obvious DNA sequence homologies. Thus, mammaglobin 
cDNA is believed to be a novel, heretofore unknown DNA sequence. 

A search of other polypeptides for sequences related to mammaglobin 
revealed an amino acid sequence homology between mammaglobin and other 
10 polypeptides. Mammaglobin exhibited 42% amino acid identity (58% including 
conservative substitutions) with rat prostatic steroid binding protein (prostatein) 
subunit C3 (rPSC3) (FIG 3) (SEQ ID NO:7). Rat prostatic steroid binding protein is a 
major secretory protein in the rat ventral prostate consisting of a tetrameric protein 
composed of two different dimeric subunits; C3/C1 and C3/C2 (Parker et al., Ann N Y 

15 Acad Sci 438:115-124; Parker et al., J Steroid Biochem 20:67-71, 1984 which are 

incorporated by reference). The CI, C2, and C3 genes all encode approximately 6 kD 
secretory proteins and are thought to have arisen from gene duplication, but while the 
CI and C2 genes show strong homology to each other, they are much less similar to 
the C3 gene. Correspondingly, mammaglobin shows no sequence homology with the 

20 CI orC2 proteins. 

As noted above, prostatic steroid binding protein (prostatein) is the major 
secretory protein in the rat ventral prostate and its expression is regulated by 
androgenic steroids (Parker et al, AnnN Y Acad Sci 438: 1 15-24, 1984; Parker et al, J 
Steroid Biochem 20:67-7 1, 1984 which are incorporated by reference). Another 

25 protein, human estramustin-binding protein (hEMBP) has been reported to be 

expressed in human prostate, human breast cancer and human malignant melanoma. 
(Bjork et al, Cancer Res 42:1935-1942, 1982; Bjork et al, Anticancer Res 11:1 173-82, 
1991 which are incorporated by reference). ^, Huinan estramustin-binding protein is 
immunocheniically similar to rat estramustin-binding protein, which has been 

30 postulated to be identical to rat steroid-binding protein, prostatein. As noted above, 
the amino acid sequence of mammaglobin exhibited 42% amino acid identity and 
5tf% homology including conservative substitutions with the C3 subunit of prostatein. 
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Thus it is possible that mammaglobin could be in some way related to hEMBP. 
However, while both prostaieiii and hEMBP are detected in the prostate gland, 
maramaglobin mRNA is completely absent in this tissue. Hence, mammaglobin is 
neither the same protein nor a subunit of hEMBP and, furthermore, the sequence of 
5 hEMBP has not been determined so that it is not known whether there is even any 
similarity of mammaglobin with some fragment or subunit of hEMBP. 

Although recent reports have demonstrated the rPSC3 promoter fused to 
S V40 T antigen produces both prostatic and mammary carcinomas in transgenic mice 
(Marouldkou etal^Proc Nat Acad Sci US 97:11236-11240, 1994; Sandmolleret al, 
10 Oncogene 9:2805-281 5, 1994 which are incorporated by reference), the true" 
biological function of this protein is unknown, furthermore, notwithstanding the 
hypothesized relationship of rat prostatic steroid binding protein to human EMBP, no 
human polypeptide or human gene corresponding to rPSC3 has been identified. Thus, 
mammaglobin and the cDNA encoding mammaglobin represent novel sequences 
15 heretofore unknown. 

Using manual alignment with other sequences that had less significant 
BLAST scores with both mammaglobin and rPSC3 protein sequences, we identified 
other homologies with human clara cell lOkD protein (tiCCIO) (SEQ ID NO:8) (Peri 
et al, J Clin Invest 92:2099-2109, 1993 which is incorporated by reference) (FIG 3) 
20 and, in addition, with rabbit and mouse uteroglobin proteins (Miele et al., Endocrine 
Rev 8:474-90, 1987; Cato and Beato, Anticancer Res 5:65-72 ; 1985; Miele et al., J 
Endocrinol Invest 1 7:679-692, 1 994 which are incorporated by reference). These 
homologies, depending on species, were 26% identity or 40% including conservative 
substitutions. In particular, a number of amino acids were perfectly conserved among 
25 all proteins, including Cys-3 and Cys-69 which are known to play a role in disulfide 
bond formation between uteroglobin subunits (see below). These homologies suggest 
that mammaglobin is a novel member of a small family of proteins that are secreted 
by bpitheiia^ Q-^-n *hc ^ >:> ^ 

The hCCIO gene is the human homologue of rabbit and mouse uteroglobin 
30 genes (Peri et al, J Clin Invest r 92:2099-2109, 1993 which is incorporated by 

reference). Uteroglobin was originally characterized as a secretory protein in rabbit 
uterus, but has since been found in other epithelial organs including lung, breast and 
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prostate. Unlike rat prostatein, uteroglobin is a homodimeric protein coupled by two 
disulfide linkages at the conserved residues Cys-2 and Cys-69 (Miele et al, 1994, 
supra). Although uteroglobin gene transcription is regulated by steroid hormones, the 
ability of the protein itself to bind progesterone or other steroid hormones is 
5 controversial and again, the true biological function of this protein is unknown (Miele 
etal., 1994, supra). 

Mammaglobin expression is restricted to the mammary gland. This is in 
contrast to the observation that rPSC3 is expressed in rat ventral prostate (Parker et 
al., Ann N Y Acad Sci 438: 1 15-1 124, 1984), and the expression of hCClO/uteroglobin 
10 in numerous tissues including lung, uterus, prostate, and breast (Miele et al., 1987, 
supra\ Cato and Beato, supra; Miele et al, 1994 supra). Because of the sequence 
homology between mammaglobin and these proteins, we determined the pattern of 
tissue specific expression. 

The 500 bp mammaglobin mRNA was easily detected in tumor specimen 
15 24 1 0 (the tissue from which this original sequence tag was isolated) and to a much 
less extent in normal human breast tissue (FIG 4A). Mammaglobin mRNA could not 
be detected in the immortalized breast epithelial cell line B5-589. Expression of 
mammaglobin was also undetectable in human uterus and lung, two sites of 
uteroglobin expression. 
20 Amplification using RT/PCR detected mammaglobin mRNA in both 

tumor 2410 and normal breast tissue, but not in 15 other tissues surveyed, including 
tissues that normally express rPSC3 and uteroglobin (lung, uterus, prostate), 
hormonally responsive and steroidogenic tissues (ovary, testis, placenta), and other 
secretory epithelial organs (colon) (FIG 4B). Therefore, the expression of 
25 mammaglobin mRNA is relatively specific for mammary tissue. 

Based on the studies in this report, mammaglobin is a relatively 
mammary-specific protein. Two other genes known to be overexpressed in breast 
carcinoma are erb-B and cyclin D (Jardines et al, Pathobiology 57:268-282, 1994; 
Keyomars and Pardee, Proc Nat Acad Sci tt£ P0:1112-1116, 1993 whichis 
30 incorporated by reference). Unlike the overexpression of erb-B or cyclin D, the 
overexpression of mammaglobin may reflect a more specific alteration of the 
m^imary epithelial cell rather than a general increased growth potential or mitotic 
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rate. As such, appearance of mammaglobin gene dysregulation may have more 
specific import for the therapeutic vulnerability or clinical course of a tumor. 

Mammaglobin expression could not be detected in normal lymph nodes or 
peripheral lymphocytes at the level of sensitivity afforded by a single step RT/PCR 
5 assay. This suggests that analysis of mammaglobin transcripts in peripheral lymph 
nodes may be useful for detecting occult breast cancer metastases, as has been 
suggested for other epithelial specific genes (Schoenfeld et ah, Cancer Res 54:2986- 
90 which is incorporated by reference). 

To demonstrate that the mammaglobin cDNA encoded a translatable 
10 protein, the cDNA clone was used in an hi vitro translation assay. Figure 5 shows the 
protein product from a rabbit reticulocyte lysate programmed with the mammaglobin 
cDNA. An approximately 6 kD protein is generated using the mammaglobin cDNA. 
The apparent molecular weight is smaller than that predicted from conceptual 
translation of the open reading frame, but this finding is commonly observed with 
1 5 rabbit and human uteroglobin translation products as well. 

Although we detected overexpression of mammaglobin RNA in one tumor 
specimen (i.e. 2410), it was not clear at what frequency this overexpression is seen in 
other breast carcinomas. We therefore examined a panel of fifteen, stage I primary 
breast carcinomas of differing histological types by Northern blot hybridization with 

20 the mammaglobin cDNA probe. Because of potential variability in expression due to 
environment influences (e.g. patient hormonal status), we also sought to compare 
tumor specimens directly with patient-matched normal breast tissues samples, 
although this was not possible in many cases. As shown in FIG 6, the 500 bp 
mammaglobin mRNA was again detected in normal breast tissue and tumor 2410. 

25 Mammaglobin was also detected in three other tumors, two of which demonstrated 

little or no expression in patient-matched normal tissue (BOIS and B022). In all, 4 of 
15 (27%) of tumors examined overexpressed mammaglobin mRNA. These data 
suggest that overexpression of mamthaglobiri is not unique to a single tumor 
specimen and is, in fact, relatively frequent among primary breast tumors. 

30 Furthermore, the fact that all tumors examined were stage I suggests that this 
dysregulation occurs relatively early in the progression of breast neoplasia. 
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Because Applicants believe mammaglobin is a secreted protein, its 
presence would be expected to be detectable in sera from patients whose tumors 
overexpress this gene product. As such, mammaglobin is likely to be as clinically 
useful as prostate specific antigen (PSA) and other solid tumor markers for managing 
5 patients with breast cancer (Tumor markers in diagnostic pathology, Clin Lab Med 
10: 1-250, 1990 which is incorporated by reference). 

We determined the prevalence of mammaglobin as a tumor marker in the 
general population of breast cancer tumors by examining the expression of 
mammaglobin in several primary breast carcinomas. Although the number of 
10 specimens examined in this study was small, 27% of tumors evaluated overexpressed 
mammaglobin mRNA. This percentage is comparable to the prevalence of other 
genetic alterations such as erb-B amplification and p53 mutation (Slamon et al. Sci 
2^:707-712, 1 989; Thor et al, J Natl Cancer Inst 5*845-855, 1992 which are 
incorporated by reference). Furthermore, because we have restricted our analysis to 
1 5 stage I tumors, overexpression of mammaglobin would actually be more prevalent 
than any other genetic alteration reported in this subgroup of tumors (Alllerd et al, J 
NaH Cancer Inst 55:200-206, 1993 which is incorporated by reference). 

The identification of mammaglobin as a breast cancer marker provides the 
basis for another aspect of the present invention, which involves methods for 
20 detecting the presence of breast cancer in a patient 

The term "detection" as used herein in the context of detection of breast neoplastic 
disease is intended to be a comprising aspect of the determining of the presence of 
breast cancer in a patient, the distinguishing of breast cancer from other diseases, the 
estimation of prognosis in terms of probable outcome of the disease and prospect for 
25 recovery, the monitoring of the disease status or the recurrence of the disease, the 
determining of a preferred therapeutic regimen for the patient and the targeting of 
antitumor therapy. 

A method for detecting breast cancer comprises hybridizing a 
polynucleotide to mRNA from breast neoplasia cells. The polynucleotide comprises 
30 SEQ ID NO: 1 or a derivative of SEQ ID NO: 1 . A derivative of a nucleotide sequence 
means that the derived nucleotide sequence is substantially the same as the sequence 
frbjn which it is derived in that the derived nucleotide sequence has sufficient 
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sequence complementarity to the sequence from which it is derived to hybridize to 
mRNA from breast neoplasia cells under the same stringency conditions that the 
sequence from which it is derived hybridizes to the mRNA from breast neoplasia 
cells. The derived nucleotide sequence is not necessarily physically derived from the 
5 nucleotide sequence, but may be generated in any manner including for example, 
chemical synthesis or DNA replication or reverse transcription or transcription. 

To detect the presence of mRNA encoding mammaglobin in a detection 
system for breast cancen a sample is obtained from a patient. The sample can be a 
tissue biopsy sample or a sample of blood, plasma, serum or the like. The sample 
10 may be treated to extract the nucleic acids contained therein. The resulting nucleic 
acid from the sample is subjected to gel electrophoresis or other size separation 
techniques. 

Detection involves contacting the nucleic acids and in particular the 

mRNA of the sample with a DNA sequence serving as a probe to form hybrid 
15 duplexes. The term "probe" refers to a structure comprised of a polynucleotide which 

forms a hybrid structure with a target sequence, due to complementarity of probe 

sequence with a sequence in the target region. 

Detection of the resulting duplex is usually accomplished by the use of 

labeled probes. Alternatively, the probe may be unlabeled, but may be detectable by 
20 specific binding with a ligand which is labeled, either directly or indirectly. Suitable 

labels and methods for labeling probes and ligands are known in the art, and include, 

for example, radioactive labels which may be incorporated by known methods (e.g., 

nick translation or kinasing), biotin, fluorescent groups, chemiluminescent groups 

(e.g., dioxetanes, particularly triggered dioxetanes), enzymes, antibodies, and the like. 
25 When using the cDN A encoding mammaloglobin or a derivative thereof as 

a probe, high stringency conditions can be used in order to prevent false positives. 

When using sequences derived from mammaglobin, less stringent conditions can be 
" ■ used; The stringency of hybridization is determined by timber of factors during 

hybridization and during the washing procedure, including temperature, ionic 
30 strength, length of time and concentration of formamide. These factors are outlined 

ip, for example, Sambrook et al. {Molecular Cloning: A Laboratory Manual, 2d ed., 

1989). 
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In order to increase the sensitivity of detection of mammaglobin mRNA in 
a sample, the technique of reverse transcriptioa'polymerization chain reaction 
(RT/PCR) can be used to amplify cDNA transcribed from mRNA encoding 
mammaglobin. The method of RT/PCR is well known in the art (for example, see 
5 Watson and Fleming, supra). 

The RT/PCR method can be performed as follows. Total cellular RNA is 
isolated by, for example, the standard guanidium isothiocyanate method and the total 
RNA is reverse transcribed. The reverse transcription method involves synthesis of 
DNA on a template of RNA using a reverse transcriptase enzyme and a 3' end primer. 
1 0 Typically, the primer contains an oligo(dT) sequence. The cDNA thus produced is 
then amplified using the PCR method and mammaglobin specific primers. 
(Belyavsky et al, Nucl Acid Res 1 7:2919-2932, 1989; Krug and Berger, Methods in 
Enzymology, Academic Press, N.Y., Vol.152, pp. 316-325, 1987 which are 
incorporated by reference) 

15 The polymerase chain reaction method is performed using two 

oligonucleotide primers that are complementary to the two flanking regions of the 
DNA segment to be amplified. The upstream and down stream primers are typically 
from 20 to 30 base pairs in length and hybridize to the flanking regions for replication 
of the nucleotide sequence. The primers are selected to be substantially 

20 complementary to the strand of cDNA being amplified. Therefore, the primers need 
not reflect the exact sequence of the template, but must be sufficiently complementary 
to selectively hybridize with the strand being amplified. 

Polymerization from the primers is catalyzed by a DNA-polymerase in the 
presence of deoxynucleptide triphosphates or nucleotide analogs to produce double- 

25 stranded DNA molecules. The double strands are then separated by any denaturing 
method including physical, chemical or enzymatic. Commonly, the method of 
physical denaturation is used involving heating the nucleic acid, typically to 
temperatures from about 80°C to 105°C for times ranging from about 1 to 10 minutes. 
The process is repeated for the desired number of cycles. 

30 Following amplification, the PCR product is then detected by ethidium 

bromide staining (Sambrook, et al., 1989, supnr). 
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In another embodiment of the present invention, the mammaglobin cDNA 
sequence or derivative thereof can be used to characterize any alteration of the 
mammaglobin gene (i.e. gene rearrangement, gene amplification, or gene deletion) in 
a specimen from a breast-cancer patient. This provides a method whereby patient 
5 specimens or samples, which do not contain intact mRNA, can still be examined for 
changes in gene structure. 

In one application of this technique, the mammaglobin cDNA sequence or 
derivative thereof is hybridized to patient genomic DNA that has been isolated from a 
patient's tumor, normal tissue, or lymphocytes and digested with one or more 
10 restriction endonucleases. Using the Southern blot protocol, which is well known in 
the art, this assay determines whether a patient or a patient's breast tumor has a 
mammaglobin gene, which was deleted, rearranged, or amplified. Detection of these 
changes can then provide important information useful for predicting prognosis and 
for patient management. 
15 In a second application of this technique, one or more pairs of 

oligonucleotide primers based on the mammaglobin cDNA sequence or derivative 
thereof could be used in the polymerase chain reaction to amplify segments of the 
mammaglobin gene from a patient sample. Analysis of the resulting PCR products 
indicate whether a particular segment of the mammaglobin gene is deleted or 
20 rearranged. Such information is useful for prognosis and patient management. 

Another method for detecting breast cancer comprises detecting the 
presence of the precursor and/or secreted forms of the mammaglobin polypeptide in a 
sample obtained from a patient. Ahy method known in the art for detecting proteins 
can be used. Such methods include, but are not limited to immunodiffusion, 
25 immunoelectrophoresis, immunochemical methods, binder-ligand assays, 

immunohistochemical techniques, agglutination and complement assays, (for 
example see Basic and Clinical Immunology, Sites and Terr, eds„ Appleton & Lange, 
Norwalk, Coim. pp 217^2^ is incorporated by reference)! Preferred are 

binder-ligand immunoassay methods including reacting antibodies with an epitope or 
30 epitopes of mammaglobin and competitively displacing a labeled mammaglobin 
polypeptide or derivative thereof. 
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As used herein, the term "mammaglobin polypeptide" embraces naturally 
occurring mammaglobin, including nonglycosylated and glycosylated precursor forms 
and the glycosylated secreted form, derivatives and fragments thereof. By naturally 
occurring is meant a polypeptide that can be isolated from a source in nature, e.g., 
5 from normal and/or diseased organisms, and that has not been intentionally modified 
by man. 

A derivative of mammaglobin is intended to refer to polypeptides which 
are comprised of a segment of at least 10 amino acids that has substantial identity to a 
portion of naturally occurring mammaglobin. The segment having substantial identity 
10 is preferably at least about 20 amino acids, more preferably at least about 50 amino 
acids, and most preferably at least about 75 amino acids. Two polypeptides have 
substantial identity when upon optimal alignment by sequence alignment programs 
such as BLAST, they share at least 80 percent sequence identity, preferably at least 95 
percent sequence identity, more preferably at least 95 percent sequence identity, most 

15 preferably 99 percent sequence identity. Preferably, residue positions which are not 
identical differ by conservative amino acid substitutions. 

Conservative amino acid substitutions refer to the interchangeability of 
residues having similar side chains. For example, a group of amino acids having 
aliphatic side chains is glycine, alanine, valine, leucine, and isoleucine; a group of 

20 amino acids having aliphatic-hydroxyl side chains is serine and threonine; a group of 
amino acids having amide-containing side chains is asparagine and glutamine; a 
group of amino acids having aromatic side chains is phenylalanine, tyrosine, and 
tryptophan; a group of amino acids having basic side chains is lysine, arginine, and 
histidine; and a group of amino acids having sulfur-containing side chains is cysteine 

25 and methionine. Preferred conservative amino acid substitution groups are: valine- 
leucine-isoleucine, phenylalanine-tyrosine, lysine-arginirie, alanine-valine, and 
asparagine-glutamine. 

A derivative mammaglobin polypeptide: will preferably cross-react with an 
anti-mammaglobin antibody, 

30 monoclonal or polyclonal, which is specific for naturally occurring mammaglobin or 
fragments thereof. 
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As used herein the terms "fragment" and "peptide" refer to a 
mammaglobin polypeptide having an amino acid sequence identical to the amino acid 
sequence deduced from a full-length mammaglobin cDNA (e.g., SEQ ID NO:l) or 
derivative thereof, but that has an amino-terminal and/or carboxy-terminal deletion. 
5 Typically, mammaglobin fragments or peptides are at least 3 amino acids long. 
Preferably a mammaglobin fragment or peptide is at least 6 amino acid residues in 
length, more preferably about 12 amino acid residues in length, even more preferably 
about 25 amino acid residues in length, and most preferably 50 amino acid residues or 
greater. 

1 0 Numerous competitive and non-competitive protein binding 

immunoassays are well known in the art. Antibodies employed in such assays may be 
unlabeled, for example as used in agglutination tests, or labeled for use a wide variety 
of assay methods. Labels that can be used include radionuclides, enzymes, 
fluorescers, chemiluminescers, enzyme substrates or co-factors, enzyme inhibitors, 

15 particles, dyes and the like for use in radioimmunoassay (RIA), enzyme 

immunoassays, e.g., enzyme-linked immunosorbent assay (ELISA), fluorescent 
immunoassays and the like. 

Polyclonal or monoclonal antibodies to a mammaglobin polypeptide 
comprising a B cell epitope can be made for use in immunoassays by any of a number 

20 of methods known in the art. As used herein, the term "B cell epitope" refers to an 
antigenic determinant of a mammaglobin polypeptide. A B cell epitope could 
comprise 3 amino acids in a spacial conformation which is unique to the epitope. 
Generally, a B cell epitope consists of at least 5 such amino acids. Methods of 
determining the spatial conformation of amino acids are known in the art, 
25 and include, for example, x-ray crystallography and 2 dimensional nuclear magnetic 
resonance. " 5 ; 

One approach for preparing antibodies to a protein is the selection and 
i ^-preparatidii of ah amiiib acid sequence <jf all 6r piart^the ^Sdiemically 

synthesizing the sequence and injecting it into ah appropriate animal, usually a rabbit 
30 or a mouse. 

^ Methods for preparation of a mammaglobin polypeptide include, but are 

ndUimited to chemical synthesis, recombinant DNA techniques or isolation from 
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biological samples. Chemical synthesis of a peptide comprising an epitope can be 
performed, for example, by the classical Merrifeld method of solid phase peptide 
synthesis (Merrifeld, J Am Chem Soc 55:2149, 1963 which is incorporated by 
reference) or the FMOC strategy on a Rapid Automated Multiple Peptide Synthesis 
5 system (DuPont Company, Wilmington, DE) (Caprino and Han, J Org Chem 
57:3404, 1972 which is incorporated by reference). 

Polyclonal antibodies can be prepared by immunizing rabbits by injecting 
antigen into the popliteal lymph nodes followed by subsequent boosts at two week 
intervals with intraperitoneal injection of antigen. The animals are bled and sera 
10 assayed against purified mammaglobin protein, usually by ELISA. 

Monoclonal antibodies can be prepared after the method of Milstein and * 
Kohler by fusing splenocytes from immunized mice with continuously replicating 
tumor cells such as myeloma or lymphoma cells. (Milstein and Kohler Nature 
256:495-497, 1975; Gulfre and Milstein, Methods in Enzymology: Immunochemical 
1 5 Techniques 73: 1 -46, Langone and Banatis eds., Academic Press, 1 98 1 which are 
incorporated by reference). The hybridoma cells, so formed are then cloned by 
limiting dilution methods and supernates assayed for antibody production by ELISA 
orRIA. 

Thus prepared polyclonal or monoclonal antibodies to mammaglobin may 
20 be used to isolate and purify precursor and secreted forms of mammaglobin from cells 
expressing mammaglobin. For example, as shown below, a polyclonal antibody 
generated against the 16 C-terminal amino acids predicted from mammaglobin cDNA 
(Glu-Val-Phe-Met-Gln-Leu-Ile-Tyr-^ 

NO: 14) binds to precursor and secreted forms of mammaglobin, as well as to 
25 mammaglobin that has been synthesized in an in vitro translation system. Isolation of 
mammaglobin using an anti-mammaglobin antibody may be accomplished using 
procedures well-known in the art,.such as affmity chromatography. 

The unique ability of antibodies tarerognize and ^pecifically bind to target 
antigens expressed by a tumor cell provides an approach for the treatment of cancer. 
30 (For review see LoBuglio and Szlch 9 AmJMed Sci 304:2 14-224, 1992; Bagshawe, 
Adv Pharmacol 24:99-121, 1993 which are incoiporated by reference). Thus, another 
asfcpct of the present invention provides for a method for preventing the onset and 
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treating breast cancer in an animal based upon the use of antibodies to mammaglobin, 
which has been discovered to be overexpressed by breast cancer cells. 

Specific antibodies to mammaglobin, either polyclonal or monoclonal, are 
produced by any method known in the art. For example, murine or human 
5 monoclonal antibodies can be produced by hybridoma technology. Alternatively, 
mammaglobin, or an immunologically active derivative or fragment thereof, or an 
anti-idiotypic antibody, or fragment thereof, can be administered to an animal to elicit 
B cell production of antibodies capable of recognizing the mammaglobin-expressing 
cells. 

1 0 The antibodies so produced or fragments thereof are labeled with one or 

more oncolytic substances such as radionuclides, toxins, or cytotoxic drugs and 
administered to a patient suspected of having breast cancer. The binding of the 
labeled antibody to the mammaglobin being overexpressed by the breast cancer cell 
will cause the death of the cancer cell. 
1 5 Any of a variety of oncolytic substances known in the art can be used to 

produce such labeled antibodies. For example, immunotoxins can be made by 
coupling plant and bacterial toxins to antibodies. Such toxins include, for example, 
ricin, diphtheria toxin and Pseudomonas exotoxin A. Drug-antibody conjugates can 
also be made in which chemotherapeutic agents are linked to the antibody. 
20 Chemotherapeutic agents suitable for such use include, for example, tomoxifen, 
doxorubicin, methotrexate, chlorambucil, Vinca alkaloids, and mitomycin. In 
addition, radioimmunoconjugates can be made in which a radionuclide is stably 
linked to the antibody. Radionuclides suitable for making radioimmunoconjugates 
include, for example, B-emmitters such as ,3, I, l88 Re, ,86 Re, 67 Cu, *°Y and 47 Sc; a- 
25 emitters such as 2,1 At, 2,2 Bi and 2,2 Pb; auger electron emitters such as l25 I and 77 Br, and 
fissionable nuclides such as ,0 B. 

The finding that a significant percentage of breast tumors express 
mamniaglbbin protein is the basis for another aspect of the invention, which involves 
the activation of mammaglobin-specific B and/or T cell lymphocytes (T c ) with 
30 mammaglobin antigens. Accordingly, the invention provides mammaglobin B cell 
antigens and T c cell antigens; vaccines comprising at least one B cell mammaglobin 
antigen and/or at least one T c mammaglobin antigen for inducing antibody- and/or 
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cell-mediated immune responses against mammaglobin-expressing tumors, and 
methods for treating a breast cancer patient with a mammaglobin-expressing tumor. 
One method according to the invention comprises administering to the patient 
activated mammaglobin-specific lymphocytes. Another method comprises 
5 administering to the patient a mammaglobin-specific vaccine. 

As used herein, "mammaglobin antigen" includes naturally occurring 
mammaglobin polypeptides, derivatives, and fragments thereof which contain a B cell 
or T c cell epitope recognized by mammaglobin-specific B cells or T c cells. 

A mammaglobin B cell antigen comprises a mammaglobin-specific B cell 
10 epitope and a T H cell epitope. The term "B-celi epitope" refers to any antigen, hapten, 
epitope or antigenic determinant which is recognized by anti-mammaglobin 
immunoglobulin receptors on B cells and is capable of eliciting the production of 
antibodies with appropriate help from T„ cells when administered to an animal. The 
B cell epitope comprises an amino acid sequence of at least 4 amino acids. 
15 Preferably, the B cell epitope is between at least 6 and 25 amino acids in length and 
more preferably is between about 15 and 22 amino acids in length. The comprising 
amino acid sequence of the B cell epitope may be identical or substantially identical 
to a Continuous amino acid sequence in a fragment of naturally occurring 
mammaglobin. Alternatively, the comprising amino acid sequence of a B cell epitope 
20 is identical to or substantially identical to a discontinuous amino acid sequence 
representing an assembled topographic determinant of mammaglobin. 

The term "T„ cell epitope" refers to any antigenic determinant recognized 
by T helper cells through association with MHC class II molecules. The activation of 
T helper cells induces differentiation of resting mammaglobin-specific B cells into 
25 higher affinity IgG-secreting cells, i.e, induces a secondary antibody response. The 
preparation and use of immunogenic peptides containing B and T H cell determinants 
to produce higher titres of specific antibody-producing B cells through T cell help is 
known in the ari;fe e t la., U.S. Patent No. 5,573,916, D^tpn, et la., 

Cancer Letters 70:143-150 (1993), Bonas-Cuesta et ah, Eur. J. Immunol 17, 1213- 
30 1215 (1987), and Good et aL, Science 2JJ:1059-1062 (1987), each of which is 

incorporated herein by reference, the T„ cell epitope may comprise an amino acid 
seqii^nce from mammaglobin or a heterologous protein. For example, Denton et al. 
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describe the induction of antibody responses to mucins, which are complex 
glycoproteins expressed in secretory epithelia and associated with breast and other 
carcinomas, in mice immunized with a synthetic peptide containing a B cell 
determinant region from the core of MUC-1 mucin linked to sequence 1 1 1-120 of 
5 influenza haemagglutinin A/X-3 1, a known helper T cell-determinant. The T H cell 
epitope comprises an amino acid sequence of between about 6 to about 20 amino acid 
residues, preferably between about 8 residues and 18 residues, even more preferably 
between 9 residues and 15 residues. 

A mammaglobin T c cell antigen comprises a T c cell epitope and a MHC 
10 class I agretope. The term "T c cell epitope" means any antigen, epitope or antigenic 
determinant which is recognized by mammaglobin-specific T c cells when presented 
by a MHC class I molecule on the surface of an antigen presenting cell. The term 
"MHC class I agretope" refers to any amino acid sequence recognized by a MHC 
class 1 molecule that allows the mammaglobin antigen to be presented to a 
1 5 mammaglobin-specific T c cell by the MHC class I molecule on an antigen presenting 
cell (APC). The T c cell epitope and MHC class I agretope are contained within an 
amino acid sequence of between about 6 to about 11 amino acids that is identical or 
substantially identical to the amino acid sequence of a fragment of naturally occurring 
mammaglobin. Preferably, the sequence is 8 or 9 amino acids in length. 
20 Methods for identifying B and T c cell epitopes for a protein antigen are 

known in the art. For example, the capacity of isolated mammaglobin-specific B cells 
or mammaglobin-specific T c cells to respond to overlapping synthetic peptides 
spanning secreted mammaglobin may be determined using standard immunobiology 
techniques. Those peptides identified as antigenic may then be modified one or a few 
25 amino acids at a time to optimize their ability to stimulate mammaglobin-specific B 
or T cells. 

B cell epitopes can also be mapped using commercially available epitope 
r mappiiig kits which involve the scl^nmg of ^dom ^jSsb^ 
to polyethylene multipin supports, e.g., Cambridge Research Biochemicals. 
30 Alternatively, the predicted mammaglobin amino acid sequence may be 

searched for sequences that conform to known binding motifs of MHC class I or 
MHC class II molecules. See e.g, Hill et al., Nature 360:434 (1992), Pamer et aL, 
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Nature 360:ZS2 (1992) and Hammer et al., 1 Exp. Med. 176:1007 (1992), and Falk et 
al, Nature JJ/:290-296 (1991), each of which is herein incorporated by reference. 
For example, antigenic peptides recognizable by breast tumor-specific CTLs may be 
identified by searching the mammaglobin amino acid sequence for HLA-A2-binding 
5 peptides as described by Peoples et al., Proc. Natl Acad. ScL 92:432-436 (1995), 
which is incorporated herein by reference. The choice of HLA-A2 as the antigen 
presenting molecule is appropriate where the patient expresses HLA-A2 
(approximately 50% of Caucasians express HLA-A2). The predicted HLA-A2 
binding peptides can be synthesized and tested for antigenicity by loading the 

10 synthetic peptides onto the T2 cell line, a human T-cell/B-cell fusion product 
containing a defect in antigen presentation such that HLA-A2 molecules on the 
surface of T2 cells can be effectively loaded with exogenous HLA-A2 binding 
peptides (Henderson, et al, Science 255:1264-1266 (1992) incorporated herein by 
reference). A standard cytotoxicity assay is then carried out which comprises 

15 incubating the peptide-kxided T2 cells with breast-specific CTLs derived from tumor 
infiltrating lymphocytes (TILs) isolated from a mammaglobin-expressing breast 
tumor, e.g., see Peoples et al., pages 432-433 and Toso et al., Cancer Research J5:16- 
20 (1996), herein incorporated by reference. 

Antigenic mammaglobin peptides containing T c cell epitopes may also be 

20 identified by acid-eluting endogenous peptides presented by HLA class I molecules 
on the tumor cell surface. (See, e.g.,Peoples et al., supra, p. 433). The eluted peptides 
may be separated by any number of techniques known in the art, including HPLC 
fractionation. The different peptide fractions are loaded onto T2 cells and the loaded 
T2 cells are incubated with breast-tumor specific CTLs to determine which peptides 

25 are recognized by the CTLs using standard immunobiology techniques. 

One use of a mammaglobin antigen according to the invention is in 
adoptive immunotherapy. This therapy involves in vitro activation and expansion by 
■r'^i a mammaglobin antigen of anti-mammaglobin antibody-producing B cells. and/or 

mammaglobin-specific cytotoxic T lymphocytes (CTLs) isolated from a patient with a 

30 mammaglobin-expressing tumor. The method may also be practiced with a 
composition comprising both mammaglobin B cell and T c cell antigens. The 
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activated lymphocytes are then introduced back into the patient for adoptive 
immunotherapy. 

A mammaglobin antigen according to the invention is also useful as a 
component of a mammaglobin-specific vaccine. The vaccine comprises an 
5 immunogenically-stimulatory amount of a mammaglobin antigen. As used herein, an 
immunostimulatory amount refers to that amount of antigen that is able to stimulate 
the desired immune response in the recipient for the amelioration, or treatment, of 
breast cancer. This amount may be determined empirically by standard procedures, 
well known to those of ordinary skill in the art, without undue experimentation. 

1 0 The antigen may be provided in any one of a number of vaccine 

formulations which are designed to induce the desired type of immune response, e.g., 
antibody and/or cell mediated. Such formulations are known in the art. See, e.g., A. 
Lanzavecchia, Science 260:937-944 ( 1 993) and U.S. Patent No. 5,585, 1 03 to 
Raychandhuri, each of which is incorporated herein by reference. Examples of 

1 5 vaccine formulations used to stimulate immune responses include pharmaceutically 
acceptable adjuvants such as aluminum salts; emulsions of squalene or squalane and 
muramyl dipeptide; liposomes; and a composition comprising a stabilizing detergent, 
a micelle-forming agent, and a biodegradable and biocompatible oil (Raychandhuri, 
supra). 

20 A mammaglobin-specific vaccine may also comprise a carrier cell loaded 

with a mammaglobin antigen. Preferably, the carrier cell is prepared from autolqgous 
professional antigen presenting cells (APC) such as macrophages, dendritic cells, or 
activated B or T lymphocytes. See e.g., Lanzavecchia, supra, p. 937. Professional 
APCs express a ligand, B7, that binds to CD28 or CTLA4 on T cells to deliver an 

25 antigen-nonspecific costimiilatory signal known as Signal 2 which prevents T cell 
anergy or inactivation. Thus, the vaccine may also comprise interieukin-2 or another 
costimulatory signal to counteract anergy induction. (Lanzavecchia, supra, p. 938.) 
■■■■■ Another formulation of a mammaglobin-specific vaccine comprises a 

recombinant vector containing a nucleotide sequence encoding for expression a 

30 mammaglobin antigen. The use of infectious agents to stimulate cytotoxic T 

lymphocytes is known in the art. (Raychaudhuri, supra.) Chimeric vectors have been 
described using vaccinia, polio, adeno-and retro-viruses, as well as bacteria such as 
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Listeria and BCG. For example, a canarypox virus vector, ALVAC has been shown 
to elicit strong cellular immune responses against encoded heterologous gene 
products (Taylor et al, Virology 757:321-328 (1991), incorporated herein by 
reference). In addition, a recombinant ALVAC expressing the MZ2-E human 
5 melanoma rejection antigen encoded by the MAGE-1 gene is able to stimulate in vitro 
MAGE-1 CTL activities in a TIL population derived from a breast tumor expressing 
MAGE-1 mRNA (Toso et al, supra). In another approach described in U.S. Patent 
No. 5593972 to Weiner et al (herein incorporated by reference), a recombinant 
expression vector encoding an antigen of an immunogenic protein to be targeted is 
10 directly administered to an individual either in vivo, e.g., to muscle cells, or to the 

cells of an individual ex vivo along with an agent that facilitates uptake of the DNA * 
into the cells. 

Those skilled in the art may readily determine how to formulate a vaccine 
suitable for achieving the desired immune response. For example, for inducing in 

1 5 vivo production of anti-mammaglobin antibodies, a mammaglobin-specific vaccine 
comprises at least one mammaglobin B cell antigen comprising a B cell epitope and a 
T H cell epitope. The T H cell epitope is preferably matched with the appropriate MHC 
class II haplotype of the intended vaccine recipient. Alternatively, a T H cell epitope 
could be used that is known to be recognized universally by humans regardless of 

20 HLA type such as the "universal" T cell epitope from tetanus toxoid (Panina- 

Bordignon et al. Eur. 7. Immunol 19:2237 (] 989), herein incorporated by reference). 
Preferably, the vaccine comprises a plurality of mammaglobin B cell antigens with T H 
epitopes recognized by MHC Class II molecules of different HLA types. 

Another embodiment of a mammaglobin-specific vaccine induces a cell- 

25 mediated response and comprises at least one mammaglobin T c antigen capable of 
activating mammaglobin-specific T c cells. Preferably, the vaccine comprises several 
T c cell iantigeris. 

* A mammaglobin-specific vaccine may also be formulated to induce both 

antibody and cell-mediated responses. This embodiment comprises both 
30 mammaglobin B cell and T c cell antigens. 

A patient with a mammaglobin-expressing tumor may be treated by 
administering to the patient an immunostimulatory amount of a mammaglobin- 
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specific vaccine according to the present invention. Administration of the vaccine 
may be by any known or standard technique. These include, but are not limited to 
intravenous, intraperitoneal, intramuscular, subcutaneous, or intramammary injection. 
Preferred embodiments of the invention are described in the following 
5 examples. Other embodiments within the scope of the claims herein will be apparent 
to one skilled in the art from consideration of the specification or practice of the 
invention as disclosed herein. It is intended that the specification, together with the 
examples, be considered exemplary only, with the scope and spirit of the invention 
being indicated by the claims which follow the examples. 
10 In the examples below, cell lines were obtained from American Type 

Culture Collection and grown in Dulbecco's minimal essential medium supplemented 
with 10% fetal calf serum. Tissue biopsy specimens were obtained from the Human 
Cooperative Tissue Network (LiVolsi et al. Cancer 77:1391-1394, 1993 which is 
incorporated by reference). 

15 

Example 1 

This example illustrates the isolation of mammaglobin cDNA. 
Total cellular RNA from the cell line MDA-MB41 5 was isolated using the 
standard guanidinium isothiocyanate method. (Belyavsky et al, supra). This RNA 
20 was used in the RACE PCR procedure employing the Amplifinder kit (Clonetech) 
and following the manufacturer's protocol. 

The synthesis of first strand cDNA was performed in a standard reaction 
containing 1 \ig RNA, 10 \xM specific mammaglobin primer D2R (5'-ATA AGA 
AAG AGA AGG TGT GG-3')(SEQ ID NO:4), 4]xl of 5X RT buffer (250 mM TrisCI 
25 pH8.3,375mMKci, 1 5inM MgCl 2 ), 2 \xl of lOOmM DTT, 1 jilof lOmMdNTPsand 
200 units of Superscript™ II reverse transcriptase (Gibco/BRL) in a reaction volume 
20 \xl The reaction proceeded for 1 hour at 45°C and was terminated by incubating at 
^ - 95X for 5 nunutes. R&A was hydrolyzed with 400 jiM NaOH at 65°C for 30 
minutes and neutralized with 400 acetic acid. 
30 The reaction mixture was then added to 3 volumes of 6M Nal and 10 §xl of 

treated glass beads. Beads were washed three times with 80% EtOH and nucleic acid 
was eluted from the beads in 45 jil of water. Nucleic acid was then precipitated and 
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resuspended in 10 nl of water. The purified first strand cDNA was ligated to the 
manufacturer's provided anchor oligonucleotide (SEQ ID NO:9, 5'-CAC GAA TTC 
ACT ATC GAT TCT GGA ACC TTC AGA GG-3'), using T4 RNA ligase at 27° for 
20 hours. One tenth of a ligation reaction was used for PCR amplification in a 50 jil 
5 reaction containing 1 manufacturer's anchor primer (SEQ ID NO: 10, 5'-CTG 
GTT CGG CCC ACC TCT GAA GGT TCC AGA ATC GAT AG-3'), 1 nM 
mammaglobin specific primer D2Rb (SEQ ID NO:l I, 5'-AAT CCG TAG TTG GTT 
TCT CAC C-3'), 200 nM dNTPs, 5 units of Vent™ DNA polymerase, and IX 
polymerase buffer ( lOmM Kcl, 20 mM TrisCl, 10 mM (NH 4 ) 2 S0 4 , 2 mM MgS0 4 , 

10 0. 1 % Triton X- 1 00). The reaction was incubated at 94° for 2 minutes and then 94° 
for 45 seconds, 50°- for 1 minute, and 72° for 90 seconds for a total of 40 times. 

The two downstream mammaglobin-specific nested oligonucleotides were 
D2R (SEQ ID NO:4) and D2Rb (SEQ ID NO: 1 1 ). An upstream mammaglobin- 
specific control oligonucleotide was also used as per the manufacturer's 

15 recommendations, D2F (5'-CTT TCT GCA AGA CCT TTG GC-3') (SEQ ID NO: 12). 
All PCR amplifications were performed with Vent DNA polymerase (New England 
Biolabs). The amplified RACE product was digested with EcoRI and ligated into the 
EcoRI and Smal sites of the plasmid vector pGEM7Z (Promega, Madison, WI). 

All sequencing was performed using the Taq DNA polymerase thermal 

20 cycle sequencing kit as per the manufacture's protocol (Promega) Briefly the 
procedure used is as follows. 

Ten pmol of sequence specific oligonucleotide was end labeled with 10 
pmol of 32 P-y ATP (3,000 Ci/mmol and 10 mCi/ml) using T4 polynucleotide kinase in 
a 10 \xl reaction for 30 minutes at 37°C. A polymerization reaction containing 100 ng 

25 of plasmid template, 1 .5 pmol of labeled sequencing primer, and 5 units of 

sequencing grade Taq polymerase was created in 17 |al of the manufacturer's provided 
sequencing buffer. This reaction was aliquoted to a set of four reaction tubes 
r containing manufacturer's provided mix of deoxynucleotides and either dideoxy-A, C, 
G, or T. The set of four tubes were incubated at 95°C for 2 minutes and then, 94°C 

30 for 45 seconds, 45°C for 30 seconds, and 72°C for 1 minute for 30 times. After 

reactions were completed, 3 \i\ of 80% formamide/bromphenol blue dye was added to 
edqh tube. Samples were heated to 70°C for 2 minutes and loaded on a 6% 
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acrylamide/7.5 M urea sequencing gel and run for 2-4 hours and 60 W constant 
power The gel was dried and then exposed to Kodak XAR5 Xray film for 2 to 24 
hours. 

The sequence thus obtained was a 403 bp fragment (SEQ ID NO:5) as 
5 shown in FIG 2, solid bar. In earlier work the DEST002 Tag sequence was isolated 
(Watson and Fleming, supra). This sequence was a 206 bp fragment (SEQ ID NO:6) 
as shown in FIG 2, open bar. Combining the information from these two sequences 
allowed the full-length 503 bp cDNA of mammaglobin to be deduced. (FIG 2). 

10 Example 2 

This example demonstrates that mammaglobin expression is restricted to 
mammary gland tumor cells and, to a lesser extent, normal mammary gland cells. 

Total cellular RNA samples were isolated using the standard guanidinium 
isothiocyanate method and treated with RNase-free DNase (Promega). For RT/PCR 
15 analysis, 1 |ig of indicated total RNA was reverse transcribed with oligo dT 2l (SEQ 
ID NO: 1 3) and Superscript II reverse transcriptase (Gibco/BRL) according to the 
manufacture's protocol. 

Two hundred ng of oligo dT 21 (SEQ ID NO: 1 3) and 1 ^g of total RNA 
were incubated at 65 °C for 5 minutes in a 10 ^1 volume. Sample was chilled on ice 
20 and added to it were 4\x\ of 5X RT buffer (250 mM TrisCl pH8.3, 375 mM Kcl, 1 5 
mM MgCU), 2 nl of 1 OOmM DTT, 1 \x\ of 1 OmM dNTPs and 200 units of 
Superscript™ II reverse transcriptase (Gibco/BRL). the reaction proceeded for 1 
hour at 45°C and was terminated by incubating at 95°C for 5 minutes. 

One tenth of each RT reaction was subject to PCR analysis using the 
25 mammaglobin specific primers D2R (5-ATA AGA AAG AGA AGG TGT GG-3 1 ) 
(SEQ ID NO:4) and d2i02 J (5'-CAG CGGCft CCT TGA TCC TTG-3') (SEQ ID 
NO:3) and standard reaction conditions for 40 cycles at 94° x 30 sec./55° x 1 min772° 
^ ^ x 1 min. • • 1 --■ : ■ "*" r: ' 

For Northern analysis, 20 ^ig of total RNA was analyzed as previously 
30 described (Watson and Fleming, supra) using the full length mammaglobin cDNA 
prc>be. Integrity and equal loading of each RNA sample was assessed by ethidium 
bromide staining. 
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As shown in FIG 4A, the 500 bp mammaglobin message is easily detected 
in tumor specimen 24 10 (the tissue from which this original DEST was isolated) and 
to a much lesser extent in normal human breast tissue but not in the immortalized 
breast epithelial cell line B5-589, or in human lung, placenta, uterus and ovary (FIG 
5 4A). Following amplification using RT/PCR analysis, mammaglobin expression was 
still not detected in 15 tissues surveyed (FIG 4B). Detection of glyceraldehyde 3- 
phosphate dehydrogenase (GAPDH) message (FIG 4B) and EGF receptor message 
(data not shown) in these reactions demonstrated that absence of expression was not 
due to degraded RNA or other trivial explanations. Thus the expression of 
10 mammaglobin mRNA is relatively specific for mammary tissue. 

Example 3 

This example demonstrates that the mammaglobin cDNA encodes a 
translatable nucleotide sequence which results in protein product of appropriately 
1 5 predicted molecular mass. In vitro translations were performed using the TNT™ 
rabbit reticulocyte translation kit with T7 RNA polymerase (Promega) and 35 S- 
Methionine (>1000 Ci/mmol; 10 mCi/ml, Amersham) according to the manufacturer's 
protocol. 

To 25 ^1 of TNT™ rabbit reticulocyte lystae was added 2 ^il of 
20 manufacturer's prepared reaction buffer, T7 RNA polymerase, 20^M amino acid 

mixture minus methionine, 40^Ci 35 S-methionine (1,000 Ci/mmol and 10 mCi/ml), 40 
units ribonuclease inhibitor, 1 fig of mammaglobin/pGEM7 plasmid, and sufficient 
DEPC treated water to create a final reaction volume of 50 jil. This reaction was 
incubated at 30°C for 60 minutes. 5fal of this reaction was removed into 20nl of SDS 
25 gel buffer, boiled for 2 minutes, and loaded on a 1 7.5% SDS-polyacrylamide gel. 

Rabbit reticulocyte lysate programmed with mammaglobin cDNA 
produced a 6kD protein while that programmed with no cDNA did not produce any 
protein product. 
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Example 4 

This example illustrates the prevalence of overexpression of 
mammaglobin in primary breast carcinoma. 

To determine the frequency of mammaglobin overexpression in breast 
5 carcinomas, we examined a panel of fifteen, stage I primary breast carcinomas of 

i.' 

differing histological types using Northern blot hybridization with the mammaglobin 
cDNA probe. Patient-matched normal breast tissues samples were also compared in 
tissues from two patients (FIG 6). The 500 bp mammaglobin mRNA was detected in 
normal breast tissue and tumor 2410 and in three other tumors, two of which when 

10 tested demonstrated little or no expression in patient-matched normal tissue (BO!5 v. 
B016; B022 v. B023) (FIG 6). In all, 4 of 15 (27%) of tumors examined 
overexpressed mammaglobin mRNA. 

These data indicate that overexpression of mammaglobin is not unique to a 
single tumor specimen and is, in fact, relatively frequent among primary breast 

15 tumors. Furthermore, the fact that ail tumors examined were stage I suggests that this 
dysregulation occurs relatively early in the progression of breast neoplasia. 

■ ■ - . 

EXAMPLE 5 

The following example illustrates the detection of the mammaglobin 
20 protein using an ahti-mammaglobin polyclonal antibody. 

The anti-mammaglobin polyclonal antibody was prepared by coupling a 
peptide corresponding to the 16 C-terminal amino acids predicted from mammaglobin 
cDNA(Glu-Val-Phe-Met-Gln-Leu-Ile-Tyr-Asp-Ser-Ser-Leu-Cys-Asp-Leu-Phe,SEQ 
ID NO.T4) to Keyhole Lymphet Hemocyanin and injecting into rabbits with Freund's 
25 adjuvant, the inoculated rabbits were boosted at three week intervals and on week 
12, the rabbits were bled and the sera was assayed for its ability to detect 
mammaglobin in serum-free conditioned medium from cultures of the breast tumor 
cell lines MDA-MB-4Y5' and MCF-7. MDA-MB-4i5 h^ beOTld^fied^ier as a 
cell line that overexpresses the mammagiobin message and MCF-7 had been 
30 identified as a cell Ime thai produces no detectable mainmaglobm mRNA. 

, The conditioned media was harvested from a 24 hr. culture and resolved 

on a 12% SDS acrylamide gel under reducing conditions (i.e., the sample was boiled 
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in buffer containing dithiothreitol (DTT) and 2-mercaptoethanol (BME) to reduce 
disulfide bonds), blotted onto a Nytran filter, and analyzed by standard Western blot 
protocols using the above described antibody to the C-terminal peptide as the primary 
antibody in this assay. After primary antibody binding, the blot was washed and 
5 secondary antibody (goat ariti-rabbit) was added. Mammaglobin-antibody complexes 
were visualized by enzyme-linked chemiluminescence (ECL Western Blotting 
Detecting Reagent, Amersham, Arlington Heights, IL). 

The anti-mammaglobin polyclonal antibody detected a band with an 
apparent molecular weight of approximately 21 kD in the conditioned media of the 
10 MDA-MB-4 1 5 cell culture (data not shown). No bands were detected in the 

conditioned medium of the MCF-7 cell culture (data not shown). Thus, consistent 
with the mRNA data, MDA-MB-41 5 cells secrete mammaglobin protein but MCF-7 
cells do not. 

The apparent molecular weight of the mammaglobin secreted into the 

1 5 MDA-MB-4 1 5 culture media is greater than the 1 0.5 kDa molecular weight 
* ■ ■ ■ i . . . ■.'*■' ■' 

calculated from the predicted amino acid sequence of SEQ ID NO:2. Since almost all 

secreted proteins are glycosylated, the cytosol of MDA-MB-4 15 cells was analyzed 

with the anti-mammaglobin polyclonal antibody to see if any precursor forms of 

secreted mammaglobin could be detected. 

20 MDA-MG-415 cells were grown for 24 hours in serum-free media, the 

culture media was collected, spun, and the resulting supernatant was collected. The 
attached cells were washed with phosphate buffered saline (PBS) and lysed with IX 
Laemmli sample buffer (2% SDS, 10% glycerol, 100 mM DTT, 60 mM Tris, pH 6.8, 
0.001% Bromophenol Blue). The lysis mixture was boiled for 5 minutes and then 

25 spun at 10,000 g for 5 minutes to pellet the cell debris. The cell lysate was transferred 
to a new tube and used for Western blot analysis as described below. 

The culture supernatant and cell lysate were run on a 12% SDS acrylamide 
gel under reducing conditions (i.e., samples boiled in buffer containing DTT and 
BME) and blotted onto a PVDF membrane using standard techniques. The blot was 

30 probed with the polyclonal antibody to the C-teiminal peptide in the presence and 
absence of the competing peptide used to generate the antibody. 
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Visualization of mammaglobin-antibody complexes were as discussed 
above. As seen in Figure 7, in the absence of competing peptide (-), the conditioned 
media (S) has the 21 kD band representative of the secreted mammaglobin protein. 
The cell lysate (C) showed a prominent band at approximately 14 kD, and several 
5 higher molecular weight bands, including one at approximately 21 kD. When the 
Western blot is performed in the presence of the competing peptide (+), the secreted 
form and intracellular forms of mammaglobin are not visualized, indicating that these 
proteins contain the peptide to which the antibody was synthesized. 

The 1 4 kD band detected only in the cell lysate likely represents a 
10 precursor, or unprocessed, form of mammaglobin. Since the predicted amino acid 
sequence for mammaglobin has the consensus N-glycosylation site, Asn-X-Thr, 
located at residues 53-55 and at residues 68-70 of SEQ ID NO:2, the observed, 
secreted 21 kD form likely represents some glycosylated form of the protein. 

This hypothesis was tested by culturing MDA-MB-415 cells in the 
1 5 presence and absence of tunicamycin, a drug that blocks N-Iinked glycosylation of 
eukaryotic proteins. Tunicamycin was added to one of two identical cultures at 1 
ug/ml and both cultures were incubated overnight for more hours. The culture media 
and cell lysate from the treated and control cultures were prepared and analyzed by 
Western blot analysis as described above. 
20 As shown in FIG 8, media from cultures (S) treated with tunicamycin (+) 

lack detectable levels of secreted mammaglobin, suggesting that secreted 
mammaglobin is glycosylated. Surprisingly, the cell cytosol form of mammaglobin 
(14 kD) was also not detectable in lysates of MDA-MB-415 cells treated with 
tunicamycin (far right lane). We hypothesize that blocking early glycosylation events 
25 with tunicamycin leads to instability and degradation of precursor forms of 

mammaglobin, thus explaining the lack of detectable 14 kD protein in the cytosol of 
tunicamycin-treated cells. 

The i>oIyciohal antibody to Ae fc-temiinal peptide of mammaglobin has 
also detected the 14 kDa precursor form of mammaglobin in cell lysates from primary 
30 human breast tumor specimens. As seen in FIG 9, the precursor form of 

mammaglobin is present in tumor specimen B023, but is undetectable in a normal 
breast tissue sample from the same patient (B022). Interestingly, some tumor samples 
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that express the mammaglobin transcript (i.e., 087R, 014, 75A and 2410) do not 
contain detectable levels of mammaglobin protein as assayed by Western blot 
analysis. One hypothesis consistent with these data is that mammaglobin expression 
is differentially regulated at the levels of transcription and translation and that this 
5 differential regulation is determined the developmental stage of the tumor. 

The anti-mammaglobin polyclonal antibody has also been used to look for 
secreted mammaglobin in breast secretions from proliferating mammary gland. 
Colostrum or mature milk fluid (500 ul samples) was collected by manual expression 
from a pregnant woman during the first and third trimester, at birth, and at day 3, 14, 

10 and 2 1 post-partum. The samples were diluted with an equal volume of 2X laemmli 
sample buffer (4% SDS, 20% glycerol, 200 mM DTT, 120 mM Tris, pH 6.8, 0.002% " 
Bromophenol Blue). The diluted samples were boiled for 5 mm. and then spun at 
10,000 g for 5 min. at 4°C to pellet cell debris. The denatured samples were 
transferred to a new tube and stored at -20°C prior to Western blot analysis as 

15 described above. 

As shown in FIG 10, the antibody detected the 21 kD secreted 
mammaglobin in breast secretions sampled during pregnancy, a period of high 
proliferation of breast epithelial cells. However, at the onset of lactation, a stage of 
breast epithelial differentiation, mammaglobin levels decreased significantly by 3 
20 days post partum and was no longer observed at 1 4 days post-partum. These results 
indicate that secreted mammaglobin is associated with proliferating breast epithelial 
cells, an observation consistent with the detection of secreted mammaglobin in human 
breast cancer. 

Reactivity with the antibody to the mammaglobin peptide has also been 
25 shown for breast tumor cells by immunohistochemical staining of paraffin-fixed 
sections of a breast cancer patient specimen (FIG 1 1). The immunohistochemical 
stammg was performed using the antibody to the mammaglobin peptide as : the 
primary antibody and then detecting the mammaglobin-antibody complex using goat 
anti-rabbit antibody tagged with horseradish peroxidase and 3, 3' diamino benzene 
30 tetrahydrochioride (DAB) as substrate. Cells expressing the mammaglobin protein 
showed a brown staining. 
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From these results, it is believed that the mamraaglobin protein is 
synthesized as a precursor protein and post-translational modifications such as N- 
linked glycosylation increase its apparent molecular weight prior to secretion; that the 
stability of precursor forms of mammoglobin is dependent on N-linked glycoslyation: 
5 and that mammaglobin protein is secreted by proliferating breast tumor cells. The 
detection of a mammaglobin protein is applicable in cancer diagnostics using the 
mammaglobin protein as a breast tumor marker, in assessing breast tumor relapse, in 
monitoring autologous bone marrow/stem cell transplants for contaminating tumor 
cells, and in targeting breast tumor cells for therapeutic intervention via antibody- 

10 mediated complexes. A purified and isolated mammaglobin polypeptide is useful for 
generating antibodies against breast tumors and in the development of other tumor- 
specific immunotherapy regimens. 

In yiew of the above, it will be seen that the several advantages of the 
invention are achieved and other advantageous results attained. 

1 5 As various changes could be made in the above methods and compositions 

without departing from the scope of the invention, it is intended that all matter 
contained in the above description and shown in the accompanying drawings shall be 
interpreted as illustrative and not in a limiting sense. 
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What is Claimed is : 

1. A mammaglobin-specific vaccine comprising an immunogenically- 
effective amount of at least one mammaglobin antigen. 

2. The vaccine of Claim 1, wherein said mammaglobin antigen is loaded 
onto a carrier cell. 

3. The vaccine of Claim 1 , further comprising a pharmaceutical!)^ 
acceptable adjuvant. 

4. The vaccine of Claim 1, comprising a mixture of mammaglobin B cell 
antigens and mammaglobin T c cell antigens. 

5: A mammaglobin-specific vaccine comprising a recombinant vector 
which comprises a nucleotide sequence encoding for expression a mammaglobin B 
cell antigen and/or a mammaglobin T c cell antigen. 

6. A method for treating a patient with a mammagiobin-expressing tumor 
which comprises administering to the patient a mammaglobin-specific vaccine 
comprising an immunogenically-effective amount of a mammaglobin antigen. 

7. The method of Claim 6, wherein said antigen is loaded onto a carrier 

cell. 

8. The method of Claim 5, wherein said vaccine also comprises a 
pharmaceutically acceptable adjuvant. 

9. The method of Claim 5, wherein said mammaglobin antigen comprises 
an amino acid sequence which is encoded for expression by a nucleotide sequence in 
a recombinant vector. 

10. A method for treating a patient with a mammglobin-expressing breast 
tumor which comprises isolating mammaglobin-specific lymphocytes from the tumor, 

activating the lymphocytes with at least one mammaglobin antigen, and 
administering the activated lymphocytes to the patient. 

1 1. An isolated and purified polypeptide comprising at least one 
mammaglobin antigen which comprises at least six contiguous amino acids of SEQ 
ID NO:2, wherein said mammaglobin antigen is recognized by B cells and/or T c cells 
specific for a naturally occurring, secreted mammaglobin polypeptide which is 
glycosylated and consists of amino acids 20-93 of SEQ ID NO:2. 

1 2. The isolated and purified polypeptide of Claim 1 1 , wherein the 
mammaglobin antigen comprises at least 12 contiguous amino acids of SEQ ID NO:2. 
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13. The isolated and purified polypeptide of Claim 12, wherein the 
mammaglobin antigen comprises at least 25 contiguous amino acids of SEQ ID NO:2. 

14. The isolated and purified polypeptide of Claim 13, wherein the 
mammaglobin antigen is glycosylated and consists of amino acids 20-93 of SEQ ID 

NO:2. 

15. The isolated and purified polypeptide of Claim 13, wherein the 
mammaglobin antigen is glycosylated and consists of SEQ ID NO:2. 

16. The isolated and purified polypeptide of Claim 11, wherein the 
mammaglobin antigen is a mammaglobin B cell antigen which induces in vitro 
activation and expansion of B cells from a breast cancer patient. 

17. The isolated and purified polypeptide of Claim 16, wherein the 
mammaglobin B cell antigen comprises a T H cell epitope from a heterologous protein. 

18. The isolated and purified polypeptide of Claim 11, wherein the 
mammaglobin antigen is a mammaglobin T c cell antigen which induces in vitro 
activation and expansion of T c cells from a breast cancer patient. 

19. The isolated and purified polypeptide of Claim 18, wherein the 
mammaglobin T c cell antigen comprises 8 or 9 contiguous amino acids of SEQ ID 
NO:2. 

20. The isolated and purified polypeptide of Claim 1 1, which comprises a 
mammaglobin B cell antigen and a mammaglobin T c cell antigen. 
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AMENDED CLAIMS 

[received by the International Bureau on 27 January 1999 (27.01 .99); 
original claim 1 amended; remaining claims unchanged (1 page)] 

1. A mammaglobin-specific vaccine comprising an immunogenically- 
effective amount of at least one mammaglobin antigen which comprises at least six 
contiguous amino acids of SEQ ID NO:2. 

2. The vaccine of Claim 1, wherein said mammaglobin antigen is loaded 
onto a carrier cell. 

3. The vaccine of Claim 1, further comprising a pharmaceutical^ 
acceptable adjuvant. 

4. The vaccine of Claim 1, comprising a mixture of mammaglobin B cell 
antigens and mammaglobin T c cell antigens. 

5. A mammaglobin-specific vaccine comprising a recombinant vector 
which comprises a nucleotide sequence encoding for expression a mammaglobin B 
cell antigen and/or a mammaglobin T c cell antigen. 

6. A method for treating a patient with a mammaglobin-expressing tumor 
which comprises administering to the patient a mammaglobin-specific vaccine 
comprising an immunogenically-effective amount of k mammaglobin antigen. 

7. The method of Claim 6, wherein said antigen is loaded onto a carrier 

cell. 

8. The method of Claim 5, wherein said vaccine also comprises a 
pharmaceutical^ acceptable adjuvant. 

9. The method of Claim 5, wherein said mammaglobin antigen comprises 
an amino acid sequence which is encoded for expression by a nucleotide sequence in 
a recombinant vector. 

1 0. A method for treating a patient with a mammglobin-expressing breast 
tumor which comprises isolating mammaglobin-specific lymphocytes from the tumor, 

activating the lymphocytes with at least one mammaglobin antigen, and 

administering the activated lymphocytes to the patient. 

1 L An isolated and purified polypeptide comprising at least one 
mammaglobin antigen which comprises at least six contiguous amino acids of SEQ 
ID NO:2, wherein said mammaglobin antigen is recognized by B cells and/or T c cells 
specific for a naturally occurring, secreted mammaglobin polypeptide which is 
glycosylated and consists of amino acids 20-93 of SEQ ID NO:2. 

12. The isolated and purified polypeptide of Claim 1 1, wherein the 
mammaglobin antigen comprises at least 12 contiguous amino acids of SEQ ID NO:2. 

AMENDED SHEET (ARTICLE 19) 



WO 99/14230 



PCT/US98/17991 



D2R 




Mammaglobin 
DDPCR Fragment 



Linker Primer 



1/12 




MB415 
Total RNA 



D2R 



^Reverse Transcriptase 



First Strand 
cDNA 



0 



T4 RNA Ligase 



Linker 



D2-102 



D2Rb 



^ Vent DNA Polymerase 



PCR Product 




Smal 
Hindlll 



Hindllj 



Figure 1 
SUBSTITUTE SHEET (RULE 26) 



WO 99/14230 



PCT/US98/17991 



2/12 

f 18 27 36 45 S4 

S' GAC AGC GGC TTC CTT GAT CCT TGC CAC COG CGA CTG AAC ACC GAC AGC AGC AGC 

^ !^ 90 99 108 

CTC ACC ATG AAG TTG CTG ATG GTC CTC ATG CTG GCG GCC CTC TGC CAG CAC TGC 
Met Lys Leu Leu Met Val Leu Met Leu Ala Ala Leu Ser Gin His Cys 
1 10 

117 126 "5 144 1S3 162 



TAC CCA GGC TCT GGC TGC CCC TTA TTG GAG AAT GTG ATT TCC AAG ACA ATC AAT 
Tyr Ala Gly Ser Gly Cys Pro Leu Leu Glu Asn Val lie Ser Lys Thr He -Asn 



20 30 
171 180 189 198 



207 216 



CCA CAA GTG TCT AAG ACT GAA TAC AAA GAA CTT CTT CAA GAG TTC ATA GAC GAC 
Pro Gin Val Ser Lys Thr Glu Tyr Lys Glu Leu Leu Gin Glu Phe lie Asp Asp 

40 SO 



22S 234 243 2S2 



261 270 



AAT GCC ACT ACA AAT GCC ATA GAT GAA TTG AAG GAA TGT TTT CTT AAC CAA ACG 
Asn Ala Thr Thr Asn Ala He Asp Glu Leu Lys Glu Cys Phe Leu Asn Gin Thr 

60 70 

279 288 297 306 31S 324 



GAT GAA ACT CTG AGC AAT GTT GAG GTG TTT ATG CAA TTA ATA TAT GAC AGC ACT 
Asp Glu Thr Leu Ser Asn Val Glu Val Phe Met Gin Leu He Tyr Asp Ser Ser 

80 



333 342 351 360 369 



378 



CTT TGT GAT TTA TTT TAA CTT TCT GCA AGA CCT TTG GCT CAC AGA ACT GCA GGG 
Leu Cys Asp Leu Phe *** 
90 



387 396 405 414 



tt — r- 



423 432 



TAT GGT GAG AAA CCA ACT ACG GAT TGC TGC AAA CCA CAC CTT CTC TTT CTT ATG 
441 4S0 <S9 4 468 477 " 48€ 



TCT TTT TAC, TAC AAA CTA CAA GAC AAT TGT TGA AAC CTG CTA TAC ATG TTT ATT 

: < 49S 



TTA ATA AAT TGA TGG CA 3 ' 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION : 

(i) APPLICANT: WASHINGTON UNIVERSITY 

fii) TITLE OF INVENTION: MAMMAGLOBIN, A SECRETED 
MAMMARY SPECIFIC BREAST CANCER PROTEIN 

(iii) NUMBER OF SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRESS: ' 

(A) ADDRESSEE: HOWELL u HAFERKAMP, L.C. 

(B) STREET: 7733 FORSYTH BOULEVARD, SUITE 1400 

(C) CITY: ST. LOUIS 

(D) STATE: MISSOURI 

(E) COUNTRY: USA 

(F) ZIP: 63105-1817 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC "compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: HENDERSON, MELODIE W. 

(B) REGISTRATION NUMBER: 37,848 

(C) REFERENCE/ DOCKET NUMBER: 6029-6443 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (314) 727-5188 

(B) TELEFAX: (314) 727-6092 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTIC'S: 

(A) LENGTH: 503 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



< (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
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GACAGCGGCT TCCTTGATCC TTGCCACCCG CGACTGAACA CCGACAGCAG CAGCCTCACC 
60 



ATGAAGTTGC TGATGGTCCT CATGCTGGCG GCCCTCTCCC AGCACTGCTA CGCAGGCTCT 
120 



GGCTGCCCCT TATTGGAGAA TGTGATTTCC AAGACAATCA ATCCACAAGT GTCTAAGACT 
180 

GAATACAAAG AACTTCTTCA AGAGTTCATA GACGACAATG CCACTACAAA TGCCATAGAT 
240 

GAATTGAAGG AATGTTTTCT TAACCAAACG GATGAAACTC TGAGCAATGT TGAGGTGTTT 
300 

ATGCAATTAA T AT AT G AC AG CAGTCTTTGT GATTTATTTT AACTTTCTGC AAGACCTTTG 
360 

GCTCACAGAA CTGCAGGGTA TGGTGAGAAA CCAACTACGG ATTGCTGCAA ACCACACCTT 
420 

CTCTTTCTTA TGTCTTTTTA CTACAAACTA CAAGACAATT GTTGAAACCT GCTATACATG 
480 

TTTATTTTAA TAAATTGATG GCA 
503 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lys Leu Leu Met Val Leu Met Leu Ala Ala Leu Ser Gin His Cys 
1 5 10 15 

Tyr Ala Gly Ser Gly Cys Pro Leu Leu Glu Asn Val He Ser Lys Thr 
20 25 30 

He Asn Pro Gin Val Ser Lys Thr Glu Tyr Lys Glu Leu Leu Gin Glu 

35 40 45 

Phe He Asp Asp Asn Ala Thr Thr Asn Ala lie" Asp Glu Leu Lys Glu 
50 55 60 

Cys Phe Leu Asn Gin Thr Asp Glu Thr Leu Ser Asn Val Glu Val Phe 
65 70 75 80 

: ; Met Gin Leu lie Tyr Asp Ser Ser Leu Cys Asp Leu Phe 
85 90 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

CAGCGGCTTC CTTGATCCTT G 
21 

(2) INFORMATION FOR SEQ ID NO:4: 

ii) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ATAAGAAAGA GAAGGTGTGG 
20 

(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE, CHARACTERISTICS:. 

(A) LENGTH: 403 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to rnRNA'--^- : ^' f ^ 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



-4xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5: 
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GACAGCGGCT TCCTTGATCC TTGCCACCCG CGACTGAACA CCGACAGCAG CAGCCTCACC 



60 



ATGAAGTTGC TGATGGTCCT CATGCTGGCG GCCCTCTCCC AGCACTGCTA CGCAGGCTCT 
120 

GGCTGCCCCT TATTGGAGAA TGTGATTTCC AAGACAATCA ATCCACAAGT GTCTAAGACT 
180 

GAATACAA^G AACTTCTTCA AGAGTTCATA GACGACAATG CCACTACAAA TGCCATAGAT 
240 

GAATTGAAGG AATGTTTTCT TAACCAAACG GATGAAACTC TGAGCAATGT TGAGGTGTTT 
300 

ATGCAATTAA TATATGACAG CAGTCTTTGT GATTTATTTT AACTTTCTGC AAGACCTTTG 
360 

GCTCACAGAA CTGCAGGGTA TGGTGAGAAA CCAACTACGG ATT 
403 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
<iv) ANTI-SENSE: NO 



(xiJ SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

TTTATGCAAT TAATATATGA CAGCAGTCTT TGTGATTTAT TTTAACTTTC TGCAAGACCT 
60 

TTGGCTCACA GAACTGCAGG GTATGGTGAG AAACCAACTA CGGATTGCTG CAAACCACAC 
120 

CTTCTCTTTC TTATGTCTTT TTACTACAAA CTACAAGACA ATTGTTGAAA CCTGCTATAC 
180 

ATGTTTATTT TAATAAATTG ATGGCA 
206 

(2) INFORMATION FOR SEQ ID NO: 7: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: sinale 
I (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Lys Leu Val Phe Leu Phe Leu Leu Val Thr lie Pro lie Cys Cvs 

Tyr Ala Ser Gly Ser Gly Cys Ser He Leu Asp Glu Val He Arg Gly 
20 25 30 

Thr ILe Asn Ser Thr Val Thr Leu His Asp Tyr Met Lys Leu Val Lys 
35 40 45 

Pro Tyr Val Gin Asp His Phe Thr Glu Lys Ala Val Lys Gin Phe Lys 
50 55 60 

Gin Cys Phe Leu Asp Gin Thr Asp Lys Thr Leu Glu Asn Val Gly Val 
65 70 75 80 

Met Met Glu Ala He Phe Asn Ser Glu Ser .Cys Gin Gin Pro Ser 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Met Lys Leu- Aia Val Thr Leu Thr Leu Val Thr Leu Ala Leu Cys Cys 
1 5 10 15 

Ser Ser Ala Ser Ala Glu He Cys Pro Ser Phe Gin Arg Val He Glu 

20 ,25 ; . 30 

Thr Leu Leu Met Asp Thr Pro Ser Ser Tyr Glu Ala Ala Met Glu Leu 
35 40 . . 45 

Phe Ser Pro Asp Gin Asp Met Arg Glu Ala Gly Ala Gin Leu Lys Lys 
5 ° 55 60 

Leu Val Asp Thr Leu Pro Gin Lys Pro Arg Glu Ser He lie Xys Leu 
65 70 75 80 

Met Glu Lys He Ala Gin Ser Ser Leu Cys Asn 
85 . 90 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 9: 

CACGAATTCA CTATCGATTC TGGAACCTTC AGAGG 
35 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CTGGTTCGGC CCACCTCTGA AGGTTCCAGA ATCGATAG 
38 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO / 
(iv) ANTI-SENSE: NO 



* (xi) SEQUENCE DESCRIPTION: SEQ' ID NO: 11: 
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AATCCGTAGT TGGTTTCTCA CC 
22 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii} HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CTTTCTGCAA GACCTTTGGC 
20 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

TTTTTTTTTT TTTTTTTTTT T 
21 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Glu Val Phe Met Gin Leu He Tyr Asp Ser Ser Leu Cys Asd Leu Phe 

1 5 10 is 
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